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Abstract. We present a model for growing information networks where the ageing of 
a node depends on the time at which it entered the network and on the last time it was 
cited. The model is shown to undergo a transition from a small-world to large-world 
network. The degree distribution may exhibit very different shapes depending on the 
model parameters, e.g. delta-peaked, exponential or power-law tailed distributions. 
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1. Introduction 

The ageing of nodes is an important process in order to understand the way information 
or social networks grow [H [2l El IH [6] . For instance, this process may be responsible 
for deviations to scale-free degree distributions [lj or for the non- vanishing values of the 
clustering coefficient observed in many networks El E] • Ageing accounts for the fact 
that old nodes lose their ability to acquire new links as time goes on, thereby limiting the 
number of active nodes to a small fraction of the whole network. In general, this effect 
embodies the notion of generation for social agents, the lifetime of an information or of 
an article, etc... Such effects may be taken into account by attributing an age r to nodes 
[2] and by assuming that their probability to receive a link from a newly entering node 
depends on their age (through some decreasing function of r) and, possibly, on other 
parameters such as their degree k (preferential attachment pUl HTJ). An alternative 
model [3 El E] assumes that nodes can be deactivated with a probability proportional to 
k~ l . In this deactivation model (DM), once a node is deactivated, it is excluded from the 
network dynamics. DM is appealing because it mimics the fact that less popular nodes 
are more easily forgotten than the popular ones. This is the case for citation networks 
[T3] (e.g. nodes are the articles and directed links are the citations of one article by 
another one), for instance, where highly cited papers usually continue to be cited for a 
long time, and vice versa. E.g. papers with more than 100 citations have an average 
citation age of 11.7 years while the publications with more than 1000 citations have 
average citation age of 18.9 years [13]. Unfortunately, DM is unsatisfactory because the 
underlying mechanism for this deactivation probability ~ k^ 1 is not identified and a 
more fundamental model is therefore of interest. 

A similar lack of clarity also occurs when one tries to justify linear preferential 
attachment models [TOj [12] . Indeed, the latter imply that entering nodes have a global 
knowledge of the network, i.e. they must be aware of the degrees of every previously 
existing nodes before connecting to one of them. This unrealistic approach can be 
elegantly circumvented by introducing redirection [HI [15] or copying [HI [171 HH1 [19] 
mechanisms. In the simplest version, that one explains in terms of citation networks for 
the sake of clarity, an author who is writing the reference list for a new paper picks a 
random pre-existing paper. Then the author cites either the randomly selected paper 
(with probability 1 — r) or one of the references within that paper (with probability r). 
It is straightforward to show that this purely local process generates linear preferential 
attachment [H]. In this Article, we proceed along the same line of thought and 
introduce a model, called Link Activation Model (LAM), that includes ageing effects. 
Its interpretation is quite natural for information networks, such as citation networks. 
The system is a growing network where, for the sake of simplicity, entering nodes have 
only one outgoing link (each paper cites one other paper). One assumes that only 
recent nodes are active but, contrary to previous models, a node is active if it has been 
introduced recently or if its has been cited recently. In detail, when an author cites a 
paper, it either selects the latest paper (the paper entered at the previous time step) 
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Figure 1. Four possible configurations when a new node enters the network. The 
latest node is darkened and the entering node is in white. With probability p{l — r), 
the latest node receives the link from the entering node. With probability pr, the 
latest node is selected, but redirection takes place, so that the father of the latest 
node receives the link from the entering node. The two other possible configurations, 
associated to the random selection of a node (in this example, node 3), occur with 
probabilities (1 — p)(l — r) and (1 —p)r. 



with probability p or a random paper with probability 1 — p. Then, with probability 
r, the author cites the paper cited by the selected paper. With probability 1 — r, he 
cites the selected paper. The model therefore depends on two parameters p and r that 
measure the importance of ageing and redirection processes as compared to random 
effects. There are four different possibilities that can take place at each time step, 
as summarized in Fig.l. An applet allowing the dynamical visualisation of the model 
should also be available online [20]. Let us stress that the ingredients of the model are 
very general and that LAM is not limited to citation networks, but should also apply 
to other information networks, e.g. the Web. 

Before going further, let us precise notations. Initially (t = 0), the network is 
composed of one node, the seed. For the sake of coherence, the seed has an outgoing link 
connected to itself. At each time step t, a new node enters the network. Consequently, 
the total number of nodes is equal to N t = 1 + t, and the number of links is also 
L t = l + t. 
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2. Height distribution 

In this section, we focus on the height distribution, the height of a node [21] being 
defined to be the length of the shortest path between this node and the seed. Let us 
note Hg-t the average number of nodes at the height g. By construction, Ho- t = 1 for all 
times. We also define l g . t to be the probability that the latest node is at height g. It is 
straightforward to show that these quantities satisfy the coupled rate equations 



1 - r)H g - l]t + rH g , t 



Hg-, t+1 = H g , t + (1 - P y- i± + p[ (i _ + r i g] 

U = (1 - P) (1 " r)H !?: + rgg;f + P[(l - r)l g -r + rlgi (1) 



except for g — 1: 



W =(l-p) g ° t t h r f li * +P(^ + r ^) ( 2 ) 

and for g = where one has the trivial solutions H = 1 and Z = (this is due to 
the fact that an entering node can only arrive at height 1 or higher). The above rate 
equations are derived in the usual way and generalise the equation with p = found 
in [22] for instance. It is straightforward to verify that N t = J2 g Hg;t = t + 1 and 

h = J2g lg;t — 1- 

Let us first focus on the case p — 1, where only latest nodes are selected, and take a 
continuous time limit (this is justified a posteriori as we are interested in the long time 
behaviour of the model). In that case, one has to solve 

d t H g . t = (1 -r)l g -x + rl g 

d t l g , t ={l-r)l g ^ + {r-l)l g . (3) 

In the following, we are interested in the behaviour of the average total height G t = 
J2g*Lo gH gi t- To do so, one also needs to evaluate the behaviour of z t = X^o 9^g\t which 
is the average height of the latest node and is easily found to satisfy 

d t z t = (l-r). (4) 

Consequently, z t asymptotically behaves like (1 — r)t and the equation for the total 
height G t reads 

d t G t = (1 - r) + (1 - r)t. (5) 

This equation leads to the asymptotic behaviour Gt = ^ r H 2 . This implies that the 
average height g t = G t /(N + 1) ~ G t /t asymptotically increases linearly with time. 
Moreover, the redirecting process slows down the growth of the network (see Fig. 2). 
This is expected as redirection favours the connection to nodes closer to the seed. In 
the limiting case p = 1, where the process is easily shown to lead to a star network (i.e. 
all the nodes are connected to the seed), one finds G t = t <^ g = 1. 




Figure 2. Typical realisations of the model when p = 1. In that case, the average 
height evolves linearly with time and one observes a large range of behaviours, from a 
aligned network (r = 0) to a star network (r = 1). The average height g increases in 

(l — r) 

a large-world way, i.e. linearly with time gt — 2 t- 
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Let us now focus on the more general case p < 1 which reads in the continuum time 

limit 

9tH 9 , = (1 - ^-r^-i + rHs, + ^ _ ^ + ^ 

= (1 ~ V) (1 ~ ^ + ^ + p[(l - r)i g -! + rZ g ] - Z g . (6) 

go+lgijt ^ (l-r)g g _ 1+ rg g;f _ ffp gt m 

£ + 1 ^ y t+1 t + 1 V J t + V y J 

and neglecting terms ~ t~ l , one obtains the following set of equations for the above 
defined average quantities 

d t G t = (l-p)(l-r + -±)+p(l-r + z t ) 

d tZt = (l-p)(l-r + %+p(l-r + z t )-z t . (8) 
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It is easy to simplify EqlH]into: 

td t G t = {l-r)t + {l-p)G t +pt zt 

td t z t = (1 - r)t + (1 - p)G t + (p - l)t zt. (9) 

Numerical integration of the above set of equations and our knowledge of the previous 
simplified cases (e.g. Eq.5) suggest to look for solutions of the form G t = Ctlog(t), 
Zt = C log(t) + K. By inserting these expressions into Eqsj9] and keeping leading terms 
in the long time limit t » 1, one finds the conditions 

1 — r 

C = K = - , (10) 

1 — p 

which cease to be valid when p — 1, in agreement with the solution of Eq.5. 
Consequently, the average height gt asymptotically grows logarithmically with time 
g t = rz^log(t). This result should be compared with the linear regime g t = ^ t 
taking place when p — 1. Let us stress that such a transition from a large- world (g t ~ t) 
to a small- world [231 121] {gt ~ l°g(t)) network has already been observed in another 
model with ageing [9] and is associated with the cross-over from a structured network, 
reminiscent of a one-dimensional line, to an unstructured network. The above solution 
is in agreement with the prediction g t = (1 — r) log(t) taking place in a model without 
ageing [2"2"] . 



3. Degree distribution 

Let us note by N k . t the average number of nodes with k incoming links. For the sake of 
clarity, we first focus on three simplified cases, r = 0, p = and p = 1 before deriving 
results for general values of the parameters. 

When r = 0, there is no possible redirection and the stochastic mechanism takes 
place during the selection of a node. With probability p, the latest node, which has 
by definition zero incoming links, receives the link of the entering node, while with 
probability 1 — p, a random node receives this link. Consequently, the rate equation for 
N k , t [25] reads 

d t N k = (l - / t ' 1 JV " iVt +p(4,i - 4,o) + *fc,o, (ii) 

where the last delta term accounts for the degree distribution of the newly entering node. 
We look for a stationary solution of the distribution n k = N k /N which is determined 
by the recurrence relations 

(1 - p)(n k -i - n k ) + p{8 kjl - Skfi) + 4,o - n k = 0. (12) 

Its solution is easily found to be 

1 — p 

2 — p 

1 — p p 
n>i = 7; + 



2-p 2-p 
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n k = yjZT^i Uu for A; > 1. (13) 

When p = 0, one recovers the exponential solution n k = (l/2) fc+1 . For increasing 
values of p, the tail of the distribution remains exponential, but its core is more and 
more peaked around k — 1. In the limiting case p — 1, the solution goes to a peaked 
distribution n k = 5 k i that corresponds to an aligned network (see Fig. 2). 

In the case p = 0, LAM reduces to the usual model with redirection for which it is 
well-known [14J that the degree distribution evolves as 

d t n k = r[(k - l)n k -i - kn k ] + (1 - r)(n*_i - n k ) + S k>0 - n k . (14) 

The stationary solution is therefore found by recurrence 

(rk + 2 — r)n k = (rk + 1 — 2r)n k -\. (15) 

This stationary solution has a power-law tail k~ u whose exponent v is obtained by 
inserting the form n k ~ k~ u into the above equation. By keeping the leading terms in 
k~\ i.e. (Jfe - l)- v = k~ u (l - l/k)~ u ~ k~ v {l + v/k), one has to solve 

(rk + 2 - r)k~ u = (rk + 1 - 2r)Ar"(l + v/k), (16) 

so that one recovers the value v = derived in 1141. 

r 1 ' 

The case p — 1 is slightly more complicated, due to the fact that the selected node 
is always the latest node. Consequently, one also has to focus on the quantity A k that 
is the average number of nodes with degree k that are cited by the latest node. By 
construction, this quantity satisfies J2 k A k = 1 (because there is only one latest node by 
construction and this latest node has only one outgoing link) and the system is described 
by the coupled set of equations 

d t N k = r(A k ^i - A k ) + (1 - r)(5 K i - S h ,o) + h,o 

d t A k = rA k _ x + (1 - r)4,i - A k . (17) 

Let u note that the equations for N k and A k are quite similar, except for their loss term. 
This is due to the fact that all nodes that do not receive a link at a time step are cited 
by nodes that cease to be the latest node by construction. The stationary values of A k 

A = 

u 

and of the distribution n k 



A k = r k - 1 (l-r), forfc>0 (U 



n = r 

n k = r k ~ l (l-r) 2 , for k > (19) 

are found by recurrence. In the case r = 0, one recovers the distribution n k = 5 k \ 
of the aligned network. Before going further, let us stress that LAM exhibits a very 
rich phenomenology, with a degree distribution that can behave like a delta peak, an 
exponential or a power-law depending on the parameters. 



Activity ageing in growing networks 



8 



By putting together the contributions of the above limiting case, it is 
straightforward to write a set of equations for general values of p and r: 

d t N k = (l-p)[r h(l-r) 



N N 
+ p[r{A k ^ - A k ) + (1 - r)(5 kA - 4,o)] + h,o 

(k - l)N k -i N k -u 
d t A k = (l-p)[r + (1-r)-^] 

+ pMw + (1 - r)4,i] ~A k , " (20) 
whose stationary solutions are found by resolving the recurrence relations 

= (1 -p)[r((k - - /cn fc ) + (1 - r)(n k -i - n k )\ 

+ p[r(A k _ 1 - A k ) + (1 - r)(<y fc>1 - 4,0)] + 6 kfi ~ n k 

= (1 - p)[r(fc - lK-i + (1 - r)n fc _i] 
+ p[rA Jfe _ 1 + (l-r)(J M ]-A fc . (21) 

It is possible to write the formal solution of the second relation: 

A = 

k 

A k = (pr) k - l p(l - r) + ^2(pry- l (l - p)[r(k - 2) + (1 - r)]n fc _i. (22) 

i=i 

After inserting this solution into the first equation of Eqs f2Tl looking for a solution of the 
form n k ~ k~ v and keeping the leading terms in k~ x , it is straightforward but lengthy 
to get the expression 

v = 1+r ~ 2pr . (23) 
r — pr 

This solution is well defined when p ^ 1 and r / and recovers the result derived 
above when p = 0. It is important to note that the tail of the distribution behaves like 
a power-law for any other value of the parameters. Let us also note that Eqf53] is a 
monotonically increasing function of p, for fixed values of r, so that ageing mechanisms 
have a tendency to diminish the number of nodes with very high degrees. This can be 
understood by noting that ageing diminishes the probability for old nodes to be cited, 
while these old nodes are typically those with the highest degree. 



4. Discussion 



In this Article, we have presented a simple model for growing networks with ageing. This 
Link Activated Model incorporates the fact that articles remain present in the collective 
memory as long as they are cited or read. Namely, articles that are the most likely to be 
cited are those that have been published recently or those that have been cited recently. 
In other words, all sorts of articles that are present and may have punctually triggered 
the reader's curiosity. This natural process is shown to lead to a rich behaviour for 
the network structure, that leads to a transition from a large-world to a small-world 
network. Moreover, various kinds of asymptotic stationary degree distributions may 
be reached depending on the model parameters: a delta peak that corresponds to a 
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one-dimensional lattice, exponential-like distributions or power-law tailed distributions. 
Let us insist on the fact that LAM is quite general and should apply to many situations 
involving a competition between multiplicative effects (rich gets richer) and ageing. 
Apart from citation networks that have been discussed above, one may think of short- 
lived information web-pages. A typical example is digg.com where users propose a 
new information/article and are subject to the votes of the whole community of users. 
Usually, informations lose their appeal within a few hours or days. 
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